Experiments with Cholesky Factorization on Clusters of SMPs
نویسندگان
چکیده
Cholesky factorization of large dense matrices is an integral part of many applications in science and engineering. In this paper we report on experiments with different parallel versions of Cholesky factorization on modern high-performance computing architectures. For the parallelization of Cholesky factorization we utilized various standard linear algebra software packages and present performance results on SMP clusters and shared-memory cc-NUMA machines. Clusters of SMPs can be characterized as hybrid parallel architectures which combine the main architectural features of distributed-memory and shared-memory parallel computers. Although the availability of SMP clusters is increasing rapidly within the scientific computing community, currently no generally accepted programming model exists for these machines. As a consequence, most application developers utilize pure distributed-memory programming models, usually based on the message passing interface (MPI), and thus may miss a number of optimization opportunities offered by the shared-memory available within the nodes of a cluster. In order to address these issues, we have experimented with different parallelization strategies for Cholesky decomposition comparing pure message passing strategies to a hybrid parallelization strategy that combines message passing with shared-memory parallelization based on multi-threading.
منابع مشابه
Incomplete Cholesky Parallel Preconditioners with Selective Inversion
Consider the solution of a large linear system when the coe cient matrix is sparse, symmetric, and positive de nite. One approach is the method of \conjugate gradient" (CG) with an incomplete Cholesky (IC) preconditioner (ICCG). A key problem with the design of a parallel ICCG solver is the bottleneck posed by repeated parallel sparse triangular solves to apply the preconditioner. Our work conc...
متن کاملParallel and fully recursive multifrontal sparse Cholesky
We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the blas and lapack. The new code is recursive at both the sparse and the dense levels, i...
متن کاملA Multilevel Block Incomplete Cholesky Preconditioner for Solving Rectangular Sparse Matrices from Linear Least Squares Problems
An incomplete factorization method for preconditioning symmetric positive definite matrices is introduced to solve normal equations. The normal equations are formed as a means to solve rectangular matrices from linear least squares problems. The procedure is based on a block incomplete Cholesky factorization and a multilevel recursive strategy with an approximate Schur complement matrix formed ...
متن کاملPreconditioning of wavelet BEM by the incomplete Cholesky factorization
The present paper is dedicated to the preconditioning of boundary element matrices in wavelet coordinates. We investigate the incomplete Cholesky factorization for a pattern which includes also the coefficients of all off-diagonal bands associated with the level-level-interactions. The pattern is chosen in such a way that the incomplete Cholesky factorization is computable in log-linear complex...
متن کاملOn Positive Semidefinite Modification Schemes for Incomplete Cholesky Factorization
Incomplete Cholesky factorizations have long been important as preconditioners for use in solving largescale symmetric positive-definite linear systems. In this paper, we focus on the relationship between two important positive semidefinite modification schemes that were introduced to avoid factorization breakdown, namely the approach of Jennings and Malik and that of Tismenetsky. We present a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005